Picture for Xin Zhou

Xin Zhou

Singapore Management University

USS-Nav: Unified Spatio-Semantic Scene Graph for Lightweight UAV Zero-Shot Object Navigation

Add code
Feb 03, 2026
Viaarxiv icon

SP^2DPO: An LLM-assisted Semantic Per-Pair DPO Generalization

Add code
Jan 29, 2026
Viaarxiv icon

SLM-SS: Speech Language Model for Generative Speech Separation

Add code
Jan 27, 2026
Viaarxiv icon

USE: A Unified Model for Universal Sound Separation and Extraction

Add code
Dec 24, 2025
Viaarxiv icon

Flying in Clutter on Monocular RGB by Learning in 3D Radiance Fields with Domain Adaptation

Add code
Dec 19, 2025
Viaarxiv icon

Step-GUI Technical Report

Add code
Dec 19, 2025
Figure 1 for Step-GUI Technical Report
Figure 2 for Step-GUI Technical Report
Figure 3 for Step-GUI Technical Report
Figure 4 for Step-GUI Technical Report
Viaarxiv icon

VLA-AN: An Efficient and Onboard Vision-Language-Action Framework for Aerial Navigation in Complex Environments

Add code
Dec 19, 2025
Figure 1 for VLA-AN: An Efficient and Onboard Vision-Language-Action Framework for Aerial Navigation in Complex Environments
Figure 2 for VLA-AN: An Efficient and Onboard Vision-Language-Action Framework for Aerial Navigation in Complex Environments
Figure 3 for VLA-AN: An Efficient and Onboard Vision-Language-Action Framework for Aerial Navigation in Complex Environments
Figure 4 for VLA-AN: An Efficient and Onboard Vision-Language-Action Framework for Aerial Navigation in Complex Environments
Viaarxiv icon

SAM 2++: Tracking Anything at Any Granularity

Add code
Oct 22, 2025
Viaarxiv icon

SecureAgentBench: Benchmarking Secure Code Generation under Realistic Vulnerability Scenarios

Add code
Sep 26, 2025
Viaarxiv icon

SoccerNet 2025 Challenges Results

Add code
Aug 26, 2025
Viaarxiv icon